2 |
A Part-of-Speech Tagger for Yiddish: First Steps in Tagging the Yiddish Book Center Corpus ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Parsing Early Modern English for Linguistic Search
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2022)
|
|
Abstract:
This work addresses the question of whether the output of a state-of-the-art parser is accurate enough to support research in theoretical linguistics. In order to build reliable models of syntactic change, we aim to eventually parse the 1.5-billion-word Early English Books Online (EEBO) corpus. But since EEBO is not yet parsed, we begin by constructing and testing a parser on the 1.7-million-word Penn-Helsinki Parsed Corpus of Early Modern English (PPCEME). In order to obtain robust results, we define an 8-fold split on PPCEME. We then evaluate the parser with evalb and, more relevantly for us, with a task-specific metric - namely, its accuracy in parsing 6 sentence types necessary to track the rise of auxiliary do (as in They did not come vs. its historical precursor They came not). Retrieving the relevant sentences from the gold and test versions with CorpusSearch queries, we find that the parser's accuracy promises to be sufficient for our purposes. A remaining concern is the variability of the output, which we plan to address with three pieces of future work sketched in the conclusion.
|
|
Keyword:
Computational Linguistics; historical linguistics; parsing; syntax
|
|
URL: https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1246&context=scil https://scholarworks.umass.edu/scil/vol5/iss1/13
|
|
BASE
|
|
Hide details
|
|
4 |
Penn-Helsinki Parsed Corpus of Early Modern English: First Parsing Results and Analysis ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Bare Infinitives and External Arguments
|
|
|
|
In: North East Linguistics Society (2020)
|
|
BASE
|
|
Show details
|
|
7 |
Incremental Phrase Structure Generation and a Universal Theory of V2
|
|
|
|
In: North East Linguistics Society (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Language Variation in Appalachia: A Special Case of Sentence Meaning
|
|
|
|
In: ASA Annual Conference (2019)
|
|
BASE
|
|
Show details
|
|
10 |
Treebank-3
|
|
|
|
In: microphone speech, newswire, telephone speech, transcribed speech, varied (2013)
|
|
BASE
|
|
Show details
|
|
|
|